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Abstract 

A  backpacker  approaches  a  road  with  a  marker  on  it,  desirous  of  finding  the  marker  but  having  only  a  rough  idea  of 
where  it  is  located.  It  is  well  known  among  backpackers  that  it  is  best  to  aim  either  right  or  left  of  the  marker,  since 
otherwise  it  will  not  be  clear  which  way  to  turn  upon  reaching  the  road.  The  problem  of  deciding  exactly  where  to  aim 
can  be  formalized  as  a  modification  of  the  Linear  Search  Problem.  This  paper  does  so,  and  also  discusses  dynamic 
programming  as  a  solution  method. 
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1.  Introduction 

The  Linear  Search  Problem  (LSP),  as  posed  by  Bellman  [4]  and  Beck  [2],  is  one  where 
a  searcher  attempts  to  locate  a  random  point  X  in  one  dimension,  starting  from  a  given  point  and 
moving  at  constant  speed  until  the  searcher’s  location  first  coincides  with  X  at  some  time  T.  The 
problem  is  to  find  the  searcher  path  that  minimizes  E(T ),  the  mean  time  to  find  X.  Here  we 
consider  a  modification  of  the  problem  where  the  starting  point  can  be  chosen  by  the  searcher. 
Since  the  motivation  for  this  is  most  easily  stated  in  terms  of  a  backpacker  approaching  a  road  with 
a  marker  on  it,  we  refer  to  the  problem  as  the  backpacker’s  LSP,  or  BLSP.  Certain  search  and 
rescue  problems  could  be  modeled  as  BLSPs,  or  see  [1]  for  additional  applications.  The  essential 
assumptions  are  that: 

(1)  An  object  is  located  on  a  line  according  to  a  known  probability  distribution. 

(2)  The  searcher  starts  at  such  a  long  distance  from  the  line  that  his  time  of  arrival  on  it  is 
essentially  independent  of  the  point  of  intersection;  that  is,  the  time  and  place  at  which  search  starts 
on  the  line  are  not  related. 

(3)  Searching  is  at  unit  speed  and  in  some  sense  expensive  per  unit  time,  but  there  are  no  fuel 
constraints  nor  any  special  time  by  which  the  target  must  be  found.  These  assumptions  justify  use 
of  E(T  )  as  the  quantity  to  be  minimized. 
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Assumption  3  is  important.  If  minimizing  E(T )  were  replaced  by  the  equally  reasonable 
objective  of  maximizing  the  probability  of  detecting  the  target  by  some  fixed  time,  then  the  optimal 
track  for  the  searcher  would  involve  at  most  one  turning  point  instead  of  the  infinite  sequence 
typical  of  the  LSP. 

As  stated  so  far,  the  BLSP  is  equivalent  to  Balkhi’s  “generalized”  linear  search  problem  [1],  the 
subject  of  Sections  2  and  3  following.  In  Section  4  we  will  introduce  the  additional  feature  that  the 
backpacker  may  choose  to  put  down  his  heavy  pack  while  he  searches  for  X.  In  that  form,  the 
BLSP  is  being  considered  here  for  the  first  time. 


2.  Analysis 

Define  a  search  plan  (SP)  to  be  a  sequence  (.y„)  for  which  •  ^  .X4  ^  ^  Xq  ^  Xi  ^  X3  ^  , 

the  understanding  being  that  the  searcher  proceeds  at  unit  speed  first  from  Xq  to  Xi  and  then  to  the 
other  points  in  the  order  of  the  index  on  .x,  a  zig-zag  path  moving  initially  to  the  right.  In  principle 
one  should  also  consider  search  plans  that  move  initially  to  the  left,  but  we  will  not  do  so  in  order  to 
avoid  tedious  notational  details.  This  definition  is  the  same  as  Beck’s  [2],  except  that  here  Xo  is  not 
required  to  be  0.  The  search  time  T  is  the  first  time  when  the  searcher’s  location  coincides  with  X, 
the  location  of  the  target. 

Whether  an  optimal  SP  exists  or  not  depends  on  the  nature  of  the  distribution  of  the  random 
variable  X.  We  will  assume  throughout  that  X  has  a  bounded  density  /( )  with  associated 
cumulative  distribution  F( ),  that  0  <  f(x)  <  1  for  all  .x,  and  that  £(1X1)  <oc  .  In  this  case  an 
optimal  SP  exists  if  Xq  is  fixed,  as  shown  in  [2,  5].  Let  K(xo)  be  the  minimum  expected  time  to  find 
the  target  if  the  searcher  is  required  to  start  at  Xq.  Then  certainly  \K{x)  —  /C(>')1  ^  |x  —  yl,  since  the 
searcher  could  always  move  from  one  point  to  the  other  as  a  first  step.  Therefore  X  ( )  is  continuous. 
Also  X(x)  ^  £(|X  —  x|),  since  the  searcher  must  cover  at  least  the  distance  |X  —  x|.  Since 
|X  —  .x|  ^  |x|  —  |X|,  it  follows  that  lim^-  x  X(x)  =  lim^  , -  x  K{x)  —cc,  and  therefore  that  the 
continuous  function  K(  )  must  have  a  minimum.  Thus  an  optimal  SP  exists,  and  the  question 
becomes  one  of  computation.  Balkhi  [1]  has  shown  the  existence  of  optimal  search  plans  under 
more  general  assumptions  about  F{ ). 

The  sequence  {x„;  n^O)  partitions  the  real  line  into  intervals,  one  of  which  must  contain  X. 
Given  the  interval,  there  is  a  simple  relation  between  T  and  X.  If  X4  <  X  ^  X2,  for  example,  then 
T  =  (x,  —  Xq)  +  (xi  —  X2)  +  (X3  —  X2)  +  (X3  —  X).  By  accounting  for  all  intervals  in  which 
X  might  lie  and  taking  expectations,  it  can  be  shown  (as  in  [8]  or  [3])  that 
F{T)  =  £(|X  —  Xol)  —  2.Xo£(xo)  +  2zl(x),  where 

A(x)  =  X  (1) 

i  =  1 

and  where 

^  _  |l  +  £(x,-i)  —  f(x,)  if  i  is  odd,  or 
|£(xi- 1)  —  £(.x,)  —  1  if  /  is  even. 

An  optimal  SP  must  have  the  property  that  (d/dXi)£(r )  =  0.  Since  (d/dxo)£(T)  = 
2(xi  —  Xo)/(.Xo)  —  1,  it  follows  that,  for  an  optimal  SP,  Xj  =  Xq  +  0.5//(xo).  For  i  >  0,  equating 
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(d/dx,)£(T  )  to  0  results  in  the  equation  a',  +  ,  =  v,  —  Hi/j\xi).  Letting  Hq  =  —  0.5,  these  equations 
can  be  written  as 

Xi  +  i  =  X,  -  Hi/fiXi).  i  ^  0.  (2) 

As  in  the  case  of  LSP,  hrst-order  optimality  conditions  determine  x  except  for  Xq,  since  (2)  can  be 
used  to  solve  for  the  x,  sequentially  once  Xq  is  known.  Some  choice  of  Xq  must  produce  an  optimal 
SP.  Other  choices  may  produce  a  sequence  that  is  not  an  SP  (numbers  do  not  diverge  from  Xq)  or 
which  is  simply  not  optimal.  Also  in  the  LSP  it  holds  that  the  optimal  SP  is  the  minimal  SP  that 
satisfies  the  analog  of  (2);  larger  starting  points  produce  a  sequence  that  diverges  too  quickly  to 
infinity  [3,  8]. 


3.  Solution  in  the  case  of  the  unit  normal 

Suppose  that  F(x)  =  <f>(x).  the  standard  normal  distribution,  and  let  y,  =  (—  1)'^^X;  for  i  5=  0. 
Since  0{-  x)  =  I  -  0{x)  for  all  x,  //,  =(-  1)'(2  -  0(y,)  -  i))  for  i  >  0,  and  therefore  (2) 

becomes 

y,  + 1  =  -  Vi  +  (2  -  0{\\)  -  <?(>',- 1 ))/(() (I’i),  /  >  0.  (3) 

with 

yi  =  -  3'o  +  O.5/0(yo)  (4) 

where  (p(x)  =  (d/dx)<f»(x)  is  the  density  function  of  X. 

Eqs.  (3)  are  identical  to  those  of  the  LSP  [8].  The  requirement  that  >’o  be  0  in  the  LSP  is  replaced 
by  (4)  in  the  BLSP.  Experimentation  with  these  equations  using  the  erf(x,  'high’)  function  of 
MATLAB  [6]  produces  the  SP  (y,)  =  (1.571 183469417,  2.7352,  3.7259,  4.6046,  5.4040,  6.1435, 
6.8357, . . .),  with  associated  E{T  )  —  2.1630.  Vq  is  given  to  more  decimal  places  than  the  other 
turning  points  because  the  sequence  produced  by  (3)  and  (4)  will  change  if  even  the  last  digit  of  yo  is 
altered.  Smaller  starting  points  result  in  the  sequence  eventually  starting  to  decrease,  and  larger 
starting  points  result  in  an  early  increase  to  MATLAB's  “infinity”.  Rousseeuw  [8]  reports  that 
E(T )  is  2.903665  when  the  searcher  starts  at  the  origin,  so  the  benefit  of  being  able  to  start  at 
a  favorable  point  in  the  BLSP  is  substantial.  The  probability  of  finding  the  target  in  the  interval 
[“  >’o^  >’i]  is  0{2.7352)  —  0{  —  1.5712)  =  0.94,  so  a  searcher  using  this  SP  will  hardly  ever  need  to 
turn  at  yq. 


4.  Accounting  for  the  pack 

A  backpacker  has  his  choice  of  two  methods  for  getting  his  pack  to  the  marker  at  X.  Method  A  is 
to  simply  carry  his  pack  until  he  gets  to  the  marker.  Method  B  is  to  remove  the  pack  upon  first 
encountering  the  road,  find  the  marker,  return  to  the  pack,  and  then  carry  the  pack  to  the  known 
location  of  the  marker.  Assuming  that  the  distance  traveled  is  (say)  K  times  as  painful  when 
carrying  a  pack  as  without  one,  which  method  should  be  used  if  the  object  is  to  minimize  total 
expected  pain,  and  how  does  the  choice  of  method  influence  the  starting  point  on  the  road? 
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Let  P  be  total  pain  from  first  contact  with  the  road  until  the  pack  is  first  located  at  the  marker. 
Then  F  =  K  T  for  method  A,  so  the  minimal  value  of  E(P)  is  2.1630K,  and  the  best  search  plan  for 
method  A  is  as  outlined  above.  With  method  B,  F  —  7  +  (K  +  1)|A'  —  XqI,  since  the  distance  from 
X  to  the  pack  needs  to  be  traversed  twice  after  X  is  located,  once  with  the  pack  and  once  without. 
Therefore  E{P)  =  {K  +  2)E{\X  —  XqI)  —  2xoF(.\'o)  +  2A(x}  for  method  B,  assuming  as  before  that 
the  optimal  SP  proceeds  first  to  the  right.  With  F  =  0,  the  first-order  conditions  for  optimality  still 
include  (3),  but  equation  (d/dxo)£(F)  =  0  now  leads  to  the  equation 

>’i  =  ((A  +  1)(1  -  0{xo))  -  0.5X)/<^(.Xo)  -  v'o.  (5) 

For  each  K  ^  1.  the  optimal  SP  can  be  found  as  before  by  experimenting  with  yo-  For 
K  =  (2,  2.706,  3),  the  best  values  of  >’o  are  (0.152632, 0.099808, 0.086843),  and  the  minimal  expected 
pain  for  method  B  is  (5.2854,  5.8530,  6.0887).  When  K  =  2.706  the  two  methods  have  the  same 
expected  pain  so  our  backpacker  should  use  method  B  in  preference  to  method  A  if  and  only  if 
carrying  the  pack  is  more  than  2.706  times  as  painful  as  not  carrying  it. 

Actually,  the  best  search  plan  may  be  neither  of  these  two  methods.  It  may  be  optimal  for  the 
searcher  to  carry  the  pack  for  several  nonadjacent  intervals  of  time,  putting  it  down  at  the  end  of 
each  interval.  It  may  even  be  that  an  optimal  search  plan  does  not  exist,  unlikely  as  that  may  seem 
given  known  results  for  the  closely  related  LSP.  The  option  of  proving  that  one  exists  in  order  to 
justify  the  application  of  first-order  conditions  to  a  problem  with  no  convenient  analytic  form  and 
many  more  variables  than  the  LSP  is  not  attractive.  In  the  next  section  we  will  shown  how  dynamic 
programming  can  be  applied  instead. 


5.  Dynamic  programming 

In  this  section  we  will  solve  a  discrete  version  of  the  problem  described  at  the  end  of  Section  4, 
where  the  backpacker  has  the  option  of  carrying  the  pack  at  some  times  but  not  at  others.  The 
marker  will  be  located  on  the  road  in  accordance  with  a  normal  distribution  with  mean  0  and 
standard  deviation  a.  The  reason  for  considering  u  ^  1  is  that  the  searcher  will  be  restricted  in  what 
follows  to  making  turns  or  changing  the  status  of  the  backpack  only  at  integer  points  on  the  line. 
By  making  (T»1  we  can  presumably  diminish  the  effect  of  this  integrality  assumption. 

Let  b{M)  be  a  bound  on  the  average  amount  of  travel  required  to  find  the  marker,  given  that  the 
marker  is  known  to  be  outside  of  the  interval  [—  M,  M]  and  that  the  searcher  is  at  M.  The  bound 
used  in  all  of  the  calculations  reported  below  is  developed  in  the  Appendix.  Consider  then  the  finite 
action  space  —  M, . . . ,  M,  where  M  is  some  positive  integer.  The  searcher  first  chooses  an  integer 
in  this  space,  then  moves  back  and  forth  within  the  action  space  until  either  the  marker  is  found  or 
it  has  been  established  that  the  marker  is  not  in  [—  M,  M].  In  the  latter  case  the  searcher  must 
return  to  the  backpack,  carry  it  to  the  nearest  endpoint,  and  then  suffer  an  additional  pain  that  is 
Kb(M  ),  on  the  average,  before  finally  getting  the  backpack  to  the  marker.  All  of  these  restrictions 
on  the  searcher’s  motion  have  the  effect  of  making  £(F)  as  calculated  in  this  section  be  an  upper 
bound  on  what  the  expected  pain  would  be  without  the  restrictions. 

We  observe  that,  at  all  integer  times  at  which  an  action  is  possible,  the  searcher  will  have  cleared 
a  certain  interval  [/,  /],  with  the  searcher  being  at  one  end  of  the  interval  and  the  backpack  located 
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somewhere  within  it.  Any  point  that  the  searcher  has  visited  has  been  “cleared".  The  set  of  cleared 
points  is  always  an  interval  regardless  of  the  searcher’s  path;  this  is  the  basic  observation  that 
makes  a  (reasonably  simple)  dynamic  program  possible.  The  “state”  of  the  situation  at  decision 
times  is  k),  where  /  is  the  location  of  the  searcher,  k  is  the  integer  location  of  the  backpack,  and 
i  is  the  location  of  the  other  end  of  the  cleared  interval  (we  permit  i  >  j,  in  which  case  the  cleared 
interval  is  [7,  /]  and  j  ^  k  ^  i).  When  the  searcher  reverses  course,  he  must  decide  what  state  to 
visit  next.  That  decision  amounts  to  selecting  and  k',  each  at  least  as  far  away  from  j  as  the 
corresponding  unprimed  quantity.  In  fact,  we  insist  that  /'  be  strictly  farther  from  j  than  i  than  i  is, 
since  a  strategy  that  reversed  course  at  i  and  then  /  and  then  i  again  could  be  improved.  If  the  latter 
point  is  not  obvious,  then  take  the  requirement  to  be  yet  another  restriction  on  the  searcher’s 
motion.  The  requirement  forces  the  cleared  set  to  strictly  increase  with  each  reversal,  and  will  also 
permit  a  dynamie  programming  recursion  that  can  be  solved  in  one  pass. 

Let  k)  be  the  minimal  expected  additional  pain  required  to  find  the  target,  given  that  the 
target  is  known  not  to  be  in  the  interval  [i,  /]  or  [7,  i],  whichever  is  appropriate,  and  that  there  will 
be  a  course  reversal  at  j,  the  current  location  of  the  searcher.  Take  H{  —  M,M,k)  to  be 
{K  +  1)(M  —  k)  +  kh{M),  for  —  M  ^  k  ^  M.  This  assignment  corresponds  to  the  idea  that  the 
searcher  returns  to  the  backpack,  brings  it  to  M,  and  then  carries  out  the  search  described  in  the 
Appendix.  Since  H{iJ,  k)  =  H{  —  i,  —  j,  —  k)  by  symmetry,  H(M,  —  M,  —  k)  is  also  defined.  Let 
Q{i,j)he  the  a  priori  probability  that  X  is  not  in  the  cleared  interval.  Then,  assuming  for  clarity  that 
the  desired  recursive  formula  for  H{i,j.k)  can  be  obtained  by  accounting  for  the  two 
possibilities  that  X  is  either  in  the  interval  [/',  /],  in  which  case  the  search  will  terminate,  or  outside 
the  interval  [/',  /],  in  which  case  the  next  state  will  be  (  /,  k‘): 


k)  =  min 

k'  <ik.  i' 


p{u)f(u)du  +  [(,/  -  /'  +  (X  -  l)(k  -  k'))  +  H{j,  i',  k')] 


Qji'J) 

QiUj) 


(6) 


where/(u)  =  (t^(u/<y)/[aQ{iJ)'\  is  the  density  of  X,  given  that  X  is  not  in  [kj],  and  p(u)  is  the  pain  in 
getting  from  /  to  u  and  possibly  returning  to  k'  to  retrieve  the  pack: 


{ ij  —  k)  -j-  K(k  —  k  )  A-  {2  K  ) {k  —  u )  if  u  ^  k  , 
[  (,/  “  k)  +  K{k  —  u)  if  w  5s  k' . 


(7) 


Letting  /i(/,j,  k)  =  H(iJ,  k)Q(i,  j),  (6)  can  be  expressed  somewhat  more  compactly  as 


h{i,pk)=  min 

k'  ^k.  r  <  i 


p(u)  [0(u/a)/ff]  d»  +  Q{i\  j )  [,/  -  /  +  (K  -  l)(k  -  k' )]  +  h{j,  r,  k'] 


(8) 


Notice  that  the  k-evaluations  on  the  right-hand  side  of  (8)  have  arguments  such  that 
N  —  j\  >  I'  —  j\-  Thus  the  computations  of  h  can  be  organized  so  that  the  size  of  the  eleared  set  is 
reduced  in  stages  from  2M  to  2M  —  1,  etc.,  until  finally  h{i,  i,  i)  is  computed  for  —  M  ^  ^  M  in 
the  final  stage  where  the  size  of  the  cleared  set  is  0.  In  the  final  stage  (and  only  in  the  final  stage)  the 
direction  of  movement  cannot  be  obtained  by  comparing  the  first  two  arguments  of  h.  In  an 
unsymmetric  situation  both  directions  should  be  tried.  The  computations  reported  below  simply 
utilize  (8),  which  amounts  to  requiring  that  the  searcher  first  travel  to  the  left. 
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The  integrals  involved  in  (8)  can  be  simplified  by  observing  that 


u\_(j)(u/(j)/(7^du  =  E(ci)  —  E{h) 

a 


(9) 


where  £(x)  =  a(j){x/G). 


6.  Results  of  the  dynamic  program 

M  =  20  and  a  =  5  were  used  in  all  cases,  with  small  (1  or  2  seconds  on  the  Naval  Postgraduate 
School  mainframe)  associated  computation  times. 

The  pain  factor  K  was  first  set  to  1  to  confirm  the  computations  reported  in  Section  3.  The 
smallest  total  pain  was  2.1641(t,  achieved  by  always  carrying  the  pack  and  making  turns  at 
—  2.0<t,  4.0a,  and  —  4.0a,  followed  by  the  procedure  described  in  the  Appendix  if  necessary.  These 
numbers  are  close  enough  to  those  of  Section  3  to  be  consistent  with  the  idea  that  the  FORTRAN 
implementation  was  in  fact  written  correctly,  as  well  as  to  confirm  that  not  too  much  has  been 
given  away  in  making  restrictions  on  the  searcher's  path. 

The  pain  factor  K  was  next  set  to  2.706,  the  "swing  factor”  of  Section  4.  The  smallest  total  pain 
was  5.1357(7,  with  the  optimal  turn  schedule  being!  —  0.4(7.  —  1.6(7.  3(7,  —  4(7, 4(7)  and  the  backpack 
schedule  being  ( —  0.4a,  —  0.4a,  1.4(7,  —  2.8<7.  4a).  Thus  the  backpacker  should  go  to  —  0.4a,  leave 
the  backpack  there  while  he  goes  left  to  -  1.6(7.  pick  it  up  and  carry  it  to  1.4(7  on  his  way  to  3(7,  etc. 
To  be  more  precise,  a  backpacker  who  follows  that  schedule  is  guaranteed  that  the  expected  pain 
will  not  exceed  5.1357(7.  that  being  the  smallest  value  possible  with  the  restrictions  that  have  been 
imposed.  Comparison  of  these  results  with  those  of  Section  4  reveals  that  neither  method  A  nor 
method  B  is  optimal  when  K  =  2.706;  the  optimal  way  of  searching  is  more  complicated  than  either 
of  them. 

The  fact  that  the  best  starting  point  is  negative  when  K  =  2.706  is  somewhat  surprising  upon 
first  acquaintance,  since  the  searcher  is  constrained  to  move  left  initially.  Sufficiently  surprising,  in 
fact,  to  have  caused  the  author  to  spend  considerable  time  trying  to  find  the  bug  in  the  FORTRAN 
program  that  produced  it.  But  in  fact  a  searcher  who  is  constrained  to  first  go  left  has  a  dilemma. 
He  can  aim  to  the  right  of  the  marker,  keep  the  pack  on,  and  hope  that  his  travel  will  be  short 
because  he  will  first  travel  through  the  heart  of  the  normal  distribution.  Alternatively,  he  can  aim 
left,  take  the  pack  off,  explore  a  large  part  of  the  left  side  of  the  normal  distribution  in  relative 
comfort,  and  then  be  reasonably  sure  that  the  next  time  he  puts  the  pack  on  will  be  the  last.  The 
first  alternative  is  best  when  K  =  1,  but  the  second  turns  out  to  be  superior  when  K  =  2.706  (or 
presumably  for  any  larger  value). 

Fig.  1  shows  graphs  of  h{i,  i,  i)  for  K  —  K  ~  2,  and  K  =  3.  For  K  =  1,  the  left  minimum  is 
slightly  higher  than  the  right  minimum  only  because  the  searcher  is  constrained  to  move  at  least 
one  unit  to  the  left.  Starting  at  —  8  (the  —  1.6(7  point),  for  example,  the  searcher  grudgingly  moves 
to  —  9  before  beginning  his  favorite  tactic  of  searching  to  the  right.  Without  the  constraint,  the  left 
half  would  be  symmetric  to  the  right.  When  K  =  3  however,  the  asymmetry  of  the  left  and  right 
halves  is  not  caused  by  the  constraint  of  moving  initially  to  the  left,  but  rather  by  different  tactics 
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for  minimization  as  described  in  the  previous  paragraph.  When  K  is  smaller  than  a  number  that  is 
about  2,  the  best  tactic  is  to  initially  retain  the  pack  and  move  towards  the  center.  When  K  is  larger 
than  that  number,  the  best  tactic  is  to  initially  put  down  the  pack  and  move  away  from  the  center. 
The  left  minimum  for  K  =  2  corresponds  to  the  latter  tactic,  and  the  right  minimum  to  the  former. 
The  left  and  right  minima  are  nearly  equal  when  K  =  2. 


7.  Summary 

Dynamic  programming  is  a  reasonable  competitor  to  analytic  methods  for  problems  such  as  the 
LSP  and  the  BLSP.  It  achieves  less  accuracy  per  unit  of  computational  effort,  but  requires  less 
theorem  proving,  computes  an  approximation  that  is  also  a  bound,  and  handles  multiple  local 
optima  with  ease.  It  is  adaptable  to  variations  such  as  backpacks  as  long  as  the  basic  assumption 
that  permits  a  simple  state  representation  is  not  disturbed.  That  assumption  in  the  case  of  the  LSP 
is  that  the  backpacker  cannot  possibly  pass  by  the  marker  without  recognizing  it,  hence  the  interval 
of  “cleared"  points. 

It  is  interesting  that  dynamic  programming  could  not  solve  the  original  “variation”  proposed  by 
Bellman  [4],  at  least  not  as  simply.  In  that  variation  the  searcher  only  recognizes  the  marker  with 
some  probability  p  <  1.  so  the  state  would  have  to  include  an  interval  of  points  searched  once 
containing  an  interval  of  points  searched  twice  containing... .  The  “curse  of  dimensionality”  would 
set  in,  since  each  additional  variable  in  the  state  representation  has  a  multiplicative  effect  on 
computation  time.  Solution  of  that  problem  by  dynamic  programming  will  have  to  be  deferred  to 
a  time  when  computers  are  bigger  and  faster. 
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Appendix 

A  bound  on  the  travel  required  in  the  LSP 

A  marker  was  originally  located  according  to  a  standard  normal  distribution,  but  is  now  known 
to  be  outside  the  interval  [  —  M,  M],  A  searcher  is  located  at  M,  and  must  travel  at  unit  speed  until 
he  finds  the  marker  at  some  time  T. 


Theorem.  For  M  >  1.  there  is  a  method  of  searching  for  which  E{T  )  <  h{M),  where 


h{M)  =  0.5 


M  +  /  ;V/-  +  (8/71)  1  +  -  1) 


1  —  exp(  -  jM) 


M. 


Proof.  Let  .x,  =  ^  M{M  +  ;)  for  /  ^  0,  an  increasing  sequence  for  which  Xq  =  M.  The  searcher 
proceeds  to  Xi.  —  X2,  x,,  —  .X4, ....  following  the  SP  (x,).  Let  G(x)  =  1  —  (f>(x),  let  c  =  0.5/G(Af), 
and  let  f  (x)  =  c’^'(x)  for  x  ^  —  M.  F{x)  =  0.5  for  |x|  ^  M,  and  F(x)  =  1  —  F(  —  x)  for  x  >  M.  F(x) 
is  the  CDF  of  the  location  of  the  marker,  given  that  it  is  not  in  the  interval  [  —  M,  M].  According  to 
Eq,  (1),  for  this  plan 

E(T  )  =  E{\X  -  Ml)  -  M  +  2  X  .x,.(2  -  F(.v,)  -  F(.x,_ ,))  (A.l) 

i  =  1 

where  the  symmetry  of  F  has  been  utilized  in  writing  the  sum.  The  expected  value  £(|X  —  M|)  when 
computed  according  to  the  distribution  F  is  0(M)/G(,M).  Since  1  -  F(x,)  =  F(—  x,)  =  c^>(—  Xj)  = 
G(Xi)/(2G(M)),  (A.l)  can  be  written  as 

E{T  )  =  ct>(M)/G(M)  -  M  +  I  v,(G(x,)  +  G(x,_  ,)).  (A.2) 

The  ratio  /?(x)  =  G{x)i<p{x)  is  known  as  Mill's  ratio.  It  is  known  [7]  that 

l/(x  +  y  x^  +  (S  t:))  <  K(x)  <  1/x  for  x  >  0.  (A. 3) 

Letting  S  be  the  sum  in  (A.2),  we  therefore  have 

S  ^  X  -  i)0(a-,-  i)) 

i=  1 

=  ^  X  (exp(  -  iM(M  +  /))  +  exp(-iM(M  +  i  -  1))Y  (A.4) 

V''2Tr,  =  iV  \j  M  +  t  -  \  J 

Let  z  =  exp(—  jM)  and  observe  that  ^/(M  +  i)/{M  +  r  —  1)  ^  ^/M/(M  —  1).  Then,  from  (A.4), 
after  factoring  out  exp(  —  jM^)  from  each  of  the  exponentials, 

S  ^  0(M )  X  z'  [  1  +  exp(i M) ) . 


(A.5) 
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The  sum  in  (A. 5)  is  a  geometric  series,  which  sums  in  closed  form  to  give 
E{T  )  ^ 


G{M)\ 


V 


M/(M  -  1 


-  M. 


The  conclusion  follows  upon  applying  the  left-hand  side  of  (A. 3)  to  the  ratio  (/)(M)/G(M). 


(A.6) 

□ 


It  is  clear  that  E(T  )  ^  M  no  matter  what  policy  the  searcher  follows,  since  the  marker  is  located 
to  the  left  of  —  M  half  the  time,  and  in  that  case  the  interval  [  —  M,  M]  of  length  2M  will  have  to 
be  traversed.  Thus  b{M)  is  necessarily  larger  than  M.  It  is  not  difficult  to  show  that 
lim^^  ,  b{M)  —  M  =  0.5.  so  the  bound  is  sharp  when  M  is  large. 
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